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Immunological characterization of papain -induced fragments of Clostridium 
botulinum type A neurotoxin and interaction of the fragments with brain 
synaptosomes . 

Kozaki S ; Miki A; Kamata Y; Ogasawara J; Sakaguchi G 

Department of Veterinary Science, College of Agriculture, University of 
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After treatment of Clostridium botulinum type A neurotoxin with papain, 
three fragments (Mrs, 101,000, 45,000, and 43,000) were purified by 
hydrophobic and ion-exchange chromatography with a high-performance liquid 
chromatographic system. Immunob lot ting analyses with monoclonal antibodies 
showed that the 101, 000-dalton fragment consisted of the light chain and a 
part of the heavy chain (H-l fragment) linked together by a disulfide bond, 
and the other two fragments were correlated to the remaining portion of the 
heavy chain (H-2 fragment). The 45,000- and 43 , 000-dalton fragments 
effectively competed for binding of the 1251-labeled neurotoxin to 
synaptosomes, while no inhibition was observed with the 101 , 000-dalton 
fragment. The results indicate that the H-2 fragment interacts with the 
binding site on the neural membrane. The binding of the neurotoxin was 
impaired by treatment of synaptosomes with neuraminidase. Incorporation of 
gangliosides into neuraminidase -treated synaptosomes resulted in the 
restoration of binding. The results suggest that gangliosides are one of 
the components of the toxin-binding site. 

Tags : Animal; Support, Non-U. S. Gov't 
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The complete amino acid sequence of the Clostridium botulinum type A 
neurotoxin, deduced by nucleotide sequence analysis of the encoding gene. 

Thompson D E ; Brehm J K; Oultram J D; Swinfield T J; Shone C C; 
Atkinson T; Melling J; Minton N P 

Division of Biotechnology, Centre for Applied Microbiology and Research, 
Porton Down, England. 

European journal of biochemistry / FEBS (GERMANY, WEST) Apr 20 1990 , 
189 (1) p73-81, ISSN 0014-2956 Journal Code: 0107600 
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A 26-mer oligonucleotide probe was synthesized (based on the determined 
amino acid sequence of the N- terminus of the Clostridium botulinum type A 
neurotoxin, BoNT/A) and used in Southern blot analysis to construct a 
restriction map of the region of the clostridial genome encompassing 
BoNT/A. The detailed information obtained enabled the cloning of the 
structural gene as three distinct fragments, none of which were capable of 
directing the expression of a toxic molecule. The central portion was 
cloned as a 2-kb PvuII-TaqI fragment and the remaining regions of the light 
chain and heavy chain as a 2,4-kb Scal-TaqI fragment and a 3.4-kb 
Hpal-PvuII fragment, respectively. The nucleotide sequence of all three 
fragments was determined and an open reading frame identified, composed of 
1296 codons corresponding to a polypeptide of 149 502 Da. The deduced amino 
acid sequence exhibited 33% similarity to tetanus toxin, with the most 
highly conserved regions occurring between the N-termini of the respective 
heavy chains. Conservation of Cys residues flanking the position at which 
the toxins are cleaved to yield the heavy chain and light chain allowed the 
tentative identification of those residues which probably form the 
disulphide bridges linking the two toxin subf ragments . 
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Botulinum neurotoxin type A: sequence of amino acids at the N- terminus 
and around the nicking site. 
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Food Research Institute, University of Wisconsin, Madison 53706. 
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Clostridium botulinum synthesizes the type A botulinum neurotoxin (NT) as 
a approximately 150 kDa single chain protein. Post-translational 
proteolytic processing yields a approximately 150 kDa dichain protein 
composed of a approximately 50 kDa light and approximately 100 kDa heavy 
chain , which has higher toxicity. Trypsin's action mimics the endogenous 
proteolytic processing. The proteolytic cleavages could occur at 4 sites. 
We have examined 2 such sites and defined the peptide sequences before and 
after proteolytic processing. The N- terminal residues of the newly 
synthesized approximately 150 kDa single chain NT, Pro-Phe-Val-Asn-Lys- , 
remain intact at the N-terminus of the approximately 50 kDa light chain 
generated either in the clostridial culture or in vitro with trypsin or 
with a protease purified from the homologous bacterial culture. The 
clostridial protease cleaves the single chain NT in vitro, at 1/3 the 
distance from its N-terminus, on the amino side of Gly of the sequence 
-Gly-Tyr-Asn-Lys-Ala-Leu-Asn-Asp-Leu- before cleaving the bond Lys-Ala at a 
slower rate. The data indicate that the dichain NT is formed in the 
bacterial culture in at least 2 steps. Cleavage at X-Gly produces a 
approximately 100 kDa heavy chain -like fragment which is then truncated; 
cleavage 4 residues downstream at Lys-Ala, and excision of the tetrapeptide 
Gly-Tyr-Asn-Lys, generates the mature heavy chain with Ala as its 
N-terminal residue. The approximately 100 kDa heavy chain generated in 
vitro, by nicking the single chain NT with trypsin, also has Ala-Leu-Asn- 
as the N-terminal residues. 
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Covalent structure of botulinum neurotoxin type A: location of sulfhydryl 
groups, and disulfide bridges and identification of C- termini of light and 
heavy chains . 
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Botulinum neurotoxin Type A is synthesized by Clostridium botulinum as a 
approximately 150 kD single chain polypeptide. The posttranslational 
processing of the 12 96 amino acid residue long gene product involves 
removal of the initiating methionine, formation of disulfide bridges, and 
limited proteolysis (nicking) by the bacterial protease (s). The mature 
dichain neurotoxin is made of a approximately 50 -kD light chain and a 
approximately 100 -kD heavy chain connected by a disulfide bridge. DNA 
derived amino acid sequence predicted a total of 9 Cys residues (Binz et 
al., 1990, J. Biol. Chem. 265, 9153-9158; Thompson et al . , 1990, Eur. J. 
Biochem. 189, 73-81) . Treatment of the dichain neurotoxin, dissolved in 6 M 
guanidine. HC1, with 4 -vinylpyridine converted 5 Cys residues into 
S-pyridylethyl cysteine residues; but alkylation after mercaptolysis 
converted all 9 Cys residues in the S-pyridylethylated form. After 
confirming the predicted number of Cys residues by amino acid analysis, the 
positions of the 5 Cys residues carrying sulfhydryl groups and the 4 
involved in disulfide bridges were determined by comparing the elution 
patterns in reversed-phase HPLC of the cyanogen bromide mixtures of the 
exclusively alkylated and the mercaptolyzed-alkylated neurotoxin. The 
chromatographically isolated components were identified by N-terminal amino 
acid sequence analysis. The HPLC patterns showed characteristic 
differences. The Cys residues predicted in positions 133, 164, 790, 966, 
and 1059 were found in the sulfhydryl form; Cys 429 and 453 were found 
disulf ide-bridge connecting the light and heavy chains, and Cys 1234 and 
1279 were found in an intrachain disulf ide-bridge near the C-terminus in 
the heavy chain. (ABSTRACT TRUNCATED AT 250 WORDS) 
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Botulinum neurotoxin type B [Precursor] 
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Bontoxilysin B 
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From 


Clostridium botulinum ITaxID: 14911 
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Comments 



FUNCTION. BOTULINUS TOXIN ACTS BY INHIBITING 
NEUROTRANSMITTER RELEASE. IT BINDS TO PERIPHERAL 
NEURONAL SYNAPSES, IS INTERNALIZED AND MOVES BY 
RETROGRADE TRANSPORT UP THE AXON INTO THE SPINAL CORD 
WHERE IT CAN MOVE BETWEEN POSTSYNAPTIC AND PRESYNAPTIC 
NEURONS. IT INHIBITS NEUROTRANSMITTER RELEASE BY ACTING 
AS A ZINC ENDOPEPTIDASE THAT CLEAVES THE 76-GLN-|-PHE-77 
BOND OF SYNAPTOBREVIN-2. 

CATALYTIC ACTIVITY. Limited hydrolysis of proteins of the 

neuroexocytosis apparatus, synaptobrevins, SNAP25 or syntaxin. No 

detected action on small molecule substrates. 

COF ACTOR. Binds 1 zinc ion per subunit (By similarity). 

SUBUNIT. DISULFIDE-LINKED HETERODIMER OF A LIGHT CHAIN 

(L) AND A HEAVY CHAIN (H). THE LIGHT CHAIN HAS THE 

PHARMACOLOGICAL ACTIVITY, WHILE THE N-AND C-TERMINAL OF 

THE HEAVY CHAIN MEDIATE CHANNEL FORMATION AND TOXIN 

BINDING, RESPECTIVELY. 

SUBCELLULAR LOCATION Secreted. 

MISCELLANEOUS. THERE ARE SEVEN ANTIGENIC ALLY DISTINCT 
FORMS OF BOTULINUM NEUROTOXIN: TYPES A, B, CI, D, E, F, AND 
G. 

SIMILARITY. BELONGS TO PEPTIDASE FAMILY M27. 



Copyright 



This SWISS-PROT entry is copyright. It is produced through a collaboration between the Swiss 
Institute of Bioinf ormatics and the EMBL outstation - the European Bioinformatics Institute. 
There are no restrictions on its use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial entities requires a license 
agreement (See http://www.isb-sib.ch/announce/ or send an email to license@isb-sib.ch ) 
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RFYESGIVFE 


1 

EYKDYFCISK 




1270 


1280 


1290 










1 

WYLKEVKRKP 


1 

YNLKLGCNWQ 


1 

FIPKDEGWTE 
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Search 
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for 



] 



Go 



Clear 



Swiss-Prot: BXB_CLOBO 

The section of the sequence BXB_CLOBO (P10844) you have selected 
corresponds to: 

SEQUENCE 1 44 



In one-letter code: 



1 11 21 

1 PVTINNFNYN DPIDNNNIIM MEPPFARGTG 
61 SSGIFNRDVC EYYDPDYLNT NDKKNIFLQT 
121 RRVPLEEFNT NIASVTVNKL ISNPGEVERK 
181 ASREGFGGIM QMKFCPEYVS VFNNVQENKG 
241 IKVDDLPIVP NEKKFFMQST DAIQAEELYT 
301 RLNKVLVCIS DPNININIYK NKFKDKYKFV 
361 AENYKIKTRA SYFSDSLPPV KIKNLLDNEI 
421 EEISKEHLAV YKIQMCKSVK APGICIDVDN 
481 IENDFPINEL ILDTDLISKI ELPSENTESL 
541 YSQTFPLDIR DISLTSSFDD ALLFSNKVYS 
601 VIEANKSNTM DKIADISLIV PYIGLALNVG 
661 WGAFLLESY IDNKNKIIKT IDNALTKRNE 
721 ALNYQAQALE EIIKYRYNIY SEKEKSNINI 
781 YLMKKMIPLA VEKLLDFDNT LKKNLLNYID 
841 IYTNDTILIE MFNKYNSEIL NNIILNLRYK 
901 TSSANSKIRV TQNQNIIFNS VFLDFSVSFW 
961 WKISIRGNRI IWTLIDINGK TKSVFFEYNI 
1021 LESNTDIKDI REVIANGEII FKLDGDIDRT 
1081 EYLKDFWGNP LMYNKEYYMF NAGNKNSYIK 
1141 GEKFIIRRKS NSQSINDDIV RKEDYIYLDF 
1201 DEFYNTIQIK EYDEQPTYSC QLLFKKDEES 
1261 WYLKEVKRKP YNLKLGCNWQ FIPKDEGWTE 



31 41 51 

RYYKAFKITD RIWIIPERYT FGYKPEDFNK 60 

MIKLFNRIKS KPLGEKLLEM IINGIPYLGD 12 0 

KGIFANLIIF GPGPVLNENE TIDIGIQNHF 180 

ASIFNRRGYF SDPALILMHE LIHVLHGLYG 24 0 

FGGQDPSIIT PSTDKSIYDK VLQNFRGIVD 3 00 

EDSEGKYSID VESFDKLYKS LMFGFTETNI 3 60 

YTIEEGFNIS DKDMEKEYRG QNKAINKQAY 42 0 

EDLFFIADKN SFSDDLSKNE RIEYNTQSNY 48 0 

TDFNVDVPVY EKQPAIKKIF TDENTIFQYL 54 0 

FFSMDYIKTA NKWEAGLFA GWVKQIVNDF 600 

NETAKGNFEN AFEIAGASIL LEFIPELLIP 660 

KWSDMYGLIV AQWLSTVNTQ FYTIKEGMYK 72 0 

DFNDINSKLN EGINQAIDNI NNFINGCSVS 780 

ENKLYLIGSA EYEKSKVNKY LKTIMPFDLS 840 

DNNLIDLSGY GAKVEVYDGV ELNDKNQFKL 900 

IRIPKYKNDG IQNYIHNEYT IINCMKNNSG 960 

REDISEYINR WFFVTITNNL NNAKIYINGK 102 0 

QFIWMKYFSI FNTELSQSNI EERYKIQSYS 1080 

LKKDSPVGEI LTRSKYNQNS KYINYRDLYI 114 0 

FNLNQEWRVY TYKYFKKEEE KLFLAPISDS 12 00 

TDEIGLIGIH RFYESGIVFE EYKDYFCISK 12 60 



In three-letter code: 



1 2 3 4 5 
1 Pro Val Thr lie Asn 
16 Asn Asn lie lie Met 
31 Arg Tyr Tyr Lys Ala 
46 Pro Glu Arg Tyr Thr 
61 Ser Ser Gly lie Phe 
76 Asp Tyr Leu Asn Thr 
91 Met lie Lys Leu Phe 



6 7 8 9 10 11 
Asn Phe Asn Tyr Asn Asp 
Met Glu Pro Pro Phe Ala 
Phe Lys lie Thr Asp Arg 
Phe Gly Tyr Lys Pro Glu 
Asn Arg Asp Val Cys Glu 
Asn Asp Lys Lys Asn lie 
Asn Arg lie Lys Ser Lys 



12 13 14 15 

Pro lie Asp Asn 15 

Arg Gly Thr Gly 3 0 

He Trp He He 45 

Asp Phe Asn Lys 60 

Tyr Tyr Asp Pro 75 

Phe Leu Gin Thr 90 

Pro Leu Gly Glu 105 
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106 


Lys 


Leu 


Leu 


Glu 


Met 


He 


He 


Asn 


Glv 


He 


Pro 


Tvr 
y 


Leu 


Glv 


Asp 


120 


121 






Val 


Pro 


Leu 


Glu 


Glu 


Phe 


Asn 


Thr 


Asn 


He 


Ala 


Ser 


Val 


135 


136 


Thr 


Val 


Asn 


Lvs 


Leu 


He 


Ser 


Asn 


Pro 


Gly 


Glu 


Val 


Glu 


Ara 


Lvs 


150 


151 


Lys 


Glv 


He 


Phe 


Ala 


Asn 


Leu 


He 


He 


Phe 


Gly 


Pro 


Glv 


Pro 


Val 


165 


166 


Leu 


Asn 


Glu 


Asn 


Glu 


Thr 


He 


Asp 


He 


Gly 


He 


Gin 


Asn 


His 


Phe 


180 


181 


Ala 


Ser 


Ara 


Glu 


Glv 


Phe 


Glv 


Glv 


He 


Met 


Gin 


Met 


Lys 


Phe 


Cys 


195 


196 


Pro 


Glu 


Tvr 


Val 


Ser 


Val 


Phe 


Asn 


Asn 


Val 


Gin 


Glu 


Asn 


Lvs 


Gly 

Jr 


210 


211 


Ala 


Ser 


He 


Phe 


Asn 


Arg 


Arg 


Glv 


Tvr 


Phe 


Ser 


Asp 


Pro 


Ala 


Leu 


225 


226 


lie 


Leu 


Met 


His 


Glu 


Leu 


He 


His 


Val 


Leu 


His 


Glv 


Leu 


Tvr 

4 


Glv 


240 


241 


He 


Lvs 


Val 


Asp 


ASP 


Leu 


Pro 


He 


Val 


Pro 


Asn 


Glu 


Lvs 


Lvs 


Phe 


255 


256 


Phe 


Met 


Gin 


Ser 


Thr 


Asp 


Ala 


He 


Gin 


Ala 


Glu 


Glu 


Leu 


Tvr 
y 


Thr 


270 


271 


Phe 


Glv 


Gly 


Gin 


ASP 


Pro 


Ser 


He 


He 


Thr 


Pro 


Ser 


Thr 


Asp 


Lvs 


285 


286 


Ser 


He 


Tvr 


ASD 


LVS 


Val 


Leu 


Gin 


Asn 


Phe 


Ara 


Gly 


He 


Val 


ASP 


300 


301 


Ara 


Leu 


Asn 


Lys 


Val 


Leu 


Val 


Cvs 


He 


Ser 


Asp 

""Jr 


Pro 


Asn 


He 


Asn 


315 


316 


He 


Asn 


He 


Tvr 
y 


Lvs 


Asn 


Lvs 


Phe 


Lvs 


Asp 


LVS 


Tyr 


Lvs 


Phe 


Val 


330 


331 


Glu 


Asp 


Ser 


Glu 


Glv 


Lys 


Tvr 

4 


Ser 


He 


Asp 


Val 


Glu 


Ser 


Phe 


Asp 


345 


346 


Lys 


Leu 


Tvr 
j 


Lys 


Ser 


Leu 


Met 


Phe 


Gly 


Phe 


Thr 


Glu 


Thr 


Asn 


He 


360 


361 


Ala 


Glu 


Asn 


Tvr 


Lys 


He 


Lvs 


Thr 


Ara 


Ala 


Ser 


Tvr 


Phe 


Ser 


Asp 


375 


376 


Ser 


Leu 


Pro 


Pro 


Val 


Lys 


He 


Lys 


Asn 


Leu 


Leu 


Asp 


Asn 


Glu 


He 


390 


391 


Tvr 


Thr 


He 


Glu 


Glu 


Gly 


Phe 


Asn 


He 


Ser 


Asp 


Lvs 


Asp 


Met 


Glu 


405 


406 


Lys 


Glu 


Tvr 


Aro 


Glv 

4 


Gin 


Asn 


Lys 


Ala 


He 


Asn 


Lvs 


Gin 


Ala 


Tvr 


420 


421 


Glu 


Glu 


He 


Ser 


Lys 


Glu 


His 


Leu 


Ala 


Val 


Tvr 


Lvs 


He 


Gin 


Met 


435 


436 


Cys 


Lys 


Ser 


Val 


Lys 


Ala 


Pro 


Glv 


He 


Cvs 


He 


Asp 


Val 


Asp 


Asn 


450 


451 


Glu 


Asp 


Leu 


Phe 


Phe 


He 


Ala 


Asp 


Lys 


Asn 


Ser 


Phe 


Ser 


Asp 


Asp 


465 


466 


Leu 


Ser 


Lys 


Asn 


Glu 


Ara 


He 


Glu 


Tvr 

4 


Asn 


Thr 


Gin 


Ser 


Asn 


Tvr 

Jr 


480 


481 


He 


Glu 


Asn 


Asp 


Phe 


Pro 


He 


Asn 


Glu 


Leu 


He 


Leu 


Asp 


Thr 


Asp 


495 


496 


Leu 


He 


Ser 


Lys 


He 


Glu 


Leu 


Pro 


Ser 


Glu 


Asn 


Thr 


Glu 


Ser 


Leu 


510 


511 


Thr 


Asp 


Phe 


Asn 


Val 


Asp 


Val 


Pro 


Val 


Tvr 


Glu 


Lys 


Gin 


Pro 


Ala 


525 


526 


He 


Lys 


Lys 


He 


Phe 


Thr 


Asp 


Glu 


Asn 


Thr 


He 


Phe 


Gin 


Tvr 
y 


Leu 


540 


541 


Tvr 


Ser 


Gin 


Thr 


Phe 


Pro 


Leu 


Asp 


He 


Arg 


Asp 


He 


Ser 


Leu 


Thr 


555 


556 


Ser 


Ser 


Phe 


Asp 


Asp 


Ala 


Leu 


Leu 


Phe 


Ser 


Asn 


Lys 


Val 


Tvr 
y 


Ser 


570 


571 


Phe 


Phe 


Ser 


Met 


Asp 


Tvr 

4 


He 


Lys 


Thr 


Ala 


Asn 


Lys 


Val 


Val 


Glu 


585 


586 


Ala 


Gly 


Leu 


Phe 


Ala 


Gly 


Trn 


Val 


Lys 


Gin 


He 


Val 


Asn 


Asp 


Phe 


600 


601 


Val 


He 


Glu 


Ala 


Asn 


Lys 


Ser 


Asn 


Thr 


Met 


Asp 


Lys 


He 


Ala 


Asp 


615 


616 


He 


Ser 


Leu 


He 


Val 


Pro 


Tvr 

4 


He 


Glv 


Leu 


Ala 


Leu 


Asn 


Val 


Glv 


630 


631 


Asn 


Glu 


Thr 


Ala 


Lvs 


Glv 


Asn 


Phe 


Glu 


Asn 


Ala 


Phe 


Glu 


He 


Ala 


645 


646 


Glv 


Ala 


Ser 


He 


Leu 


Leu 


Glu 


Phe 


He 


Pro 


Glu 


Leu 


Leu 


He 


Pro 


660 


661 


Val 


Val 


Glv 


Ala 


Phe 


Leu 


Leu 


Glu 


Ser 


Tvr 
y 


He 


Asp 


Asn 


Lvs 

y 


Asn 


675 


676 


Lys 


He 


He 


Lys 


Thr 


He 


Asp 


Asn 


Ala 


Leu 


Thr 


Lys 


Ara 


Asn 


Glu 


690 


691 


Lys 


TrD 

Jr 


Ser 


Asp 


Met 


Tvr 


Glv 


Leu 


He 


Val 


Ala 


Gin 


Trp 

tr 


Leu 


Ser 


705 


706 


Thr 


Val 


Asn 


Thr 


Gin 


Phe 


Tvr 

4 


Thr 


He 


Lys 


Glu 


Glv 


Met 


Tvr 
y 


Lys 


720 


721 


Ala 


Leu 


Asn 


Tvr 


Gin 


Ala 


Gin 


Ala 


Leu 


Glu 


Glu 


He 


He 


Lys 


Tvr 
y 


735 


736 




Tvr 


Asn 


He 


Tvr 


Ser 


Glu 


Lys 


Glu 


Lys 


Ser 


Asn 


He 


Asn 


He 


750 


751 


Asp 


Phe 


Asn 


Asp 


He 


Asn 


Ser 


Lvs 


Leu 


Asn 


Glu 


Gly 


He 


Asn 


Gin 


765 


766 


Ala 


He 


Asp 


Asn 


He 


Asn 


Asn 


Phe 


He 


Asn 


Glv 


Cys 


Ser 


Val 


Ser 


780 


781 


Tvr 


Leu 


Met 


Lys 


Lys 


Met 


He 


Pro 


Leu 


Ala 


Val 


Glu 


Lvs 


Leu 


Leu 


795 


796 


Asp 


Phe 


Asp 


Asn 


Thr 


Leu 


Lys 


Lys 


Asn 


Leu 


Leu 


Asn 


Tvr 
y 


He 


Asp 


810 


811 


Glu 


Asn 


Lys 


Leu 


Tvr 


Leu 


He 


Glv 


Ser 


Ala 


Glu 


Tvr 

y 


Glu 


Lys 


Ser 


825 


826 


Lys 


Val 


Asn 


Lys 


Tvr 

J. J A. 


Leu 


Lys 


Thr 


He 


Met 


Pro 


Phe 


Asp 


Leu 


Ser 


840 


841 


He 


Tvr 

J. J J. 


Thr 


Asn 


Asp 


Thr 


He 


Leu 


He 


Glu 


Met 


Phe 


Asn 


Lvs 


Tvr 

4 


855 


856 


Asn 


Ser 


Glu 


He 


Leu 


Asn 


Asn 


He 


He 


Leu 


Asn 


Leu 


Ara 


Tvr 

y 


Lys 


870 


871 


Asp 


Asn 


Asn 


Leu 


He 


Asp 


Leu 


Ser 


Glv 


Tvr 

4 


Glv 


Ala 


Lys 


Val 


Glu 


885 


886 


Val 


Tvr 

4 


Asp 


Glv 


Val 


Glu 


Leu 


Asn 


ASP 


LVS 


Asn 


Gin 


Phe 


Lvs 


Leu 


900 


901 


Thr 


Ser 


Ser 


Ala 


Asn 


Ser 


Lys 


He 


Ara 


Val 


Thr 


Gin 


Asn 


Gin 


Asn 


915 


916 


He 


He 


Phe 


Asn 


Ser 


Val 


Phe 


Leu 


Asp 


Phe 


Ser 


Val 


Ser 


Phe 


Trp 

A Jr 


930 


931 


He 


Arg 


He 


Pro 


Lys 


Tvr 


Lys 


Asn 


Asp 


Glv 


He 


Gin 


Asn 


Tvr 


He 


945 


946 


His 


Asn 


Glu 


Tvr 


Thr 


lie 


He 


Asn 


Cvs 


Met 


Lys 


Asn 


Asn 


Ser 


Glv 


960 


961 


Trn 


Lys 


He 


Ser 


He 


Ara 


Glv 


Asn 


Ara 


He 


He 


Tro 


Thr 


Leu 


He 


975 


976 


Asp 


He 


Asn 


Glv 


Lys 


Thr 


Lys 


Ser 


Val 


Phe 


Phe 


Glu 


Tvr 
y 


Asn 


He 


990 


991 


Arcr 


Glu 


Asp 


He 


Ser 


Glu 


Tvr 

4 


He 


Asn 


Ara 


Trp 

tr 


Phe 


Phe 


Val 


Thr 


1005 


1006 


He 


Thr 


Asn 


Asn 


Leu 


Asn 


Asn 


Ala 


Lys 


He 


Tvr 
y 


He 


Asn 


Glv 


Lys 


1020 


1021 


Leu 


Glu 


Ser 


Asn 


Thr 


Asp 


He 


Lvs 


Asp 


He 


Arq 


Glu 


Val 


He 


Ala 


1035 


1036 


Asn 


Gly 


Glu 


He 


He 


Phe 


Lys 


Leu 


Asp 


Gly 


Asp 


He 


Asp 


Arg 


Thr 


1050 


1051 


Gin 


Phe 


He 


Trp 


Met 


Lys 


Tyr 


Phe 


Ser 


He 


Phe 


Asn 


Thr 


Glu 


Leu 


1065 


1066 


Ser 


Gin 


Ser 


Asn 


He 


Glu 


Glu 


Arg 


Tyr 


Lys 


He 


Gin 


Ser 


Tyr 


Ser 


1080 
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1081 


Glu 


Tyr 


Leu 


Lys 


Asp 


Phe 


Trp 


Gly 


Asn 


Pro 


Leu 


Met 


Tyr 


Asn 


Lys 


1095 


1096 


Glu 


Tyr 


Tyr 


Met 


Phe 


Asn 


Ala 


Gly 


Asn 


Lys 


Asn 


Ser 


Tvr 


He 


Lvs 


1110 


1111 


Leu 


Lys 


Lys 


Asp 


Ser 


Pro 


Val 


Gly 


Glu 


He 


Leu 


Thr 


Arg 


Ser 


Lys 


1125 


1126 


Tyr 


Asn 


Gin 


Asn 


Ser 


Lys 


Tyr 


He 


Asn 


Tvr 


Ara 


Asp 


Leu 


Tyr 


He 


1140 


1141 


Gly 


Glu 


Lys 


Phe 


He 


He 


Arc? 


Arg 


Lys 


Ser 


Asn 


Ser 


Gin 


Ser 


He 


1155 


1156 


Asn 


Asp 


Asp 


He 


Val 


Arq 


Lys 


Glu 


Asp 


Tvr 


He 


Tyr 


Leu 


Asp 


Phe 


1170 


1171 


Phe 


Asn 


Leu 


Asn 


Gin 


Glu 


Trt) 


Arcr 


Val 


Tvr 


Thr 


Tvr 


Lvs 


Tvr 


Phe 


1185 


1186 


Lys 


Lys 


Glu 


Glu 


Glu 


Lys 


Leu 


Phe 


Leu 


Ala 


Pro 


He 


Ser 


Asd 


Ser 


1200 


1201 


Asp 


Glu 


Phe 


Tyr 


Asn 


Thr 


He 


Gin 


He 


Lvs 


Glu 


Tyr 


Asp 


Glu 


Gin 


1215 


1216 


Pro 


Thr 


Tyr 


Ser 


Cys 


Gin 


Leu 


Leu 


Phe 


Lvs 


Lvs 


Asp 


Glu 


Glu 


Ser 


1230 


1231 


Thr 


Asp 


Glu 


He 


Gly 


Leu 


He 


Gly 


He 


His 


Arg 


Phe 


Tyr 


Glu 


Ser 


1245 


1246 


Gly 


He 


Val 


Phe 


Glu 


Glu 


Tyr 


Lys 


Asp 


Tyr 


Phe 


Cys 


He 


Ser 


Lys 


1260 


1261 


Trp 


Tyr 


Leu 


Lys 


Glu 


Val 


Lys 


Arg 


Lys 


Pro 


Tyr 


Asn 


Leu 


Lys 


Leu 


1275 


1276 


Gly 


Cys 


Asn 


Trp 


Gin 


Phe 


He 


Pro 


Lys 


Asp 


Glu 


Gly 


Trp 


Thr 


Glu 
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ID CBBONT standard; DNA; PRO; 4041 BP. 

XX 

AC M81186; 
XX 

SV M81186.1 
XX 

DT 28-MAY-1992 (Rel. 32, Created) 

DT 04-MAR-2000 (Rel. 63, Last updated, Version 4) 

XX 

DE Clostridium botulinum neurotoxin type B (botB) gene, compl 
XX 

KW botB gene; neurotoxin type B. 



XX 

OS Clostridium botulinum 

OC Bacteria; Firmicutes; Clostridia; Clostridiales ; Clostridiaceae; 

OC Clostridium. 

XX 

RN [1] 

RP 1-4041 

RA Whelan S.M., Elmore M.J, Bodsworth N.J., Brehm J.K., Atkinson T. , 

RA Minton N.P. ; 

RT "Complete nucleotide sequence of the Clostridium botulinum gene encoding 

RT the type B neurotoxin"; 

RL Unpublished. 
XX 

DR GOA; P10844; P10844 . 

DR SWISS-PROT; P10844; BXB_CLOBO. 

XX 

FH Key Location/Qualifiers 
FH 

FT source 1. .4041 

FT /db_xref="taxon:14 91" 

FT /organism= "Clostridium botulinum" 

FT CDS 57 . .3932 

FT / codon_s t ar t = 1 

FT /db_xref = "GOA: P10844 " 

FT / db_xref = "SWISS-PROT : PI 0844 " 

FT / trans l_table= 11 

FT /function= "vertebrate neurotoxin" 

FT /product= "neurotoxin type B" 

FT /gene="botB H 

FT /protein_id="AAA23211.1" 

FT / translation= "MPVTINNFNYNDPIDNNNIIMMEPPFARGTGRYYKAFKITDRIWI 

FT I PERYTFGYKPEDFNKS SGI FNRDVCE YYDPD YLNTNDKKNI FLQTMI KLFNRI KSKPL 

FT GEKLLEMI INGI P YLGDRRVPLEEFNTNI AS VTVNKL I SNPGEVERKKGI FANLI I FGP 

FT GPVLNENETIDIGIQNHFASREGFGGIMQMKFCPEYVSVFNNVQENKGASIFNRRGYFS 

FT DPALILMHELIHVLHGLYGIKVDDLPIVPNEKKFFMQSTDAIQAEELYTFGGQDPSIIT 

FT PSTDKSIYDKVLQNFRGIVDRLNKVLVCISDPNININIYKNKFKDKYKFVEDSEGKYSI 

FT DVESFDKLYKSLMFGFTETNIAENYKIKTRASYFSDSLPPVKIKNLLDNEIYTIEEGFN 

FT ISDKDMEKEYRGQNKAINKQAYEEISKEHLAVYKIQMCKSVKAPGICIDVDNEDLFFIA 

FT DKNSFSDDLSKNERIEYNTQSNYIENDFPINELILDTDLISKIELPSENTESLTDFNVD 

FT / VPVYEKQPAIKKIFTDENTIFQYLYSQTFPLDIRDISLTSSFDDALLFSNKVYSFFSMD 

FT / YIKTANKWEAGLFAGWVKQIVNDFVIEANKSNTMDKIADISLIVPYIGLALNVGNETA 

FT^ KGNFENAFEIAGASILLEFIPELLIPWGAFLLESYIDNKNKIIKTIDNALTKRNEKWS 

FT DMYGLIVAQWLSTVNTQFYTIKEGMYKALNYQAQALEEIIKYRYNIYSEKEKSNINIDF 

FT NDINSKLNEGINQAIDNINNFINGCSVSYLMKKMIPLAVEKLLDFDNTLKKNLLNYIDE 

FT NKLYLIGSAEYEKSKVNKYLKTIMPFDLSIYTNDTILIEMFNKYNSEILNWIILNLRYK 

FT DNNLIDLSGYGAKVEVYDGVELNDKNQFKLTSSANSKIRVTQNQNIIFNSVFLDFSVSF 

FT WIRIPKYKNDGIQNYIHNEYTIINCMKNNSGWKISIRGNRIIWTLIDINGKTKSVFFEY 

FT NIREDISEYINRWFFVTITNNLNNAKIYINGKLESNTDIKDIREVIANGEIIFKLDGDI 

FT DRTQFIWMKYFSIFNTELSQSNIEERYKIQSYSEYLKDFWGNPLMYNKEYYMFNAGNKN 

FT SYIKLKKDSPVGEILTRSKYNQNSKYINYRDLYIGEKFIIRRKSNSQSINDDIVRKEDY 

FT IYLDFFNLNQEWRVYTYKYFKKEEEKLFLAPISDSDEFYNTIQIKEYDEQPTYSCQLLF 

FT KKDEESTDEIGLIGIHRFYESGIVFEEYKDYFCISKWYLKEVKRKPYNLKLGCNWQFIP 

FT KDEGWTE" 
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XX 

SQ Sequence 4041 BP; 1679 A; 383 C; 645 G; 1334 T; 0 other; 

tgcgcattta tgggcattaa aagggatata aacttaaaat aaggaggaga atatttatgc 60 

cagttacaat aaataatttt aattataatg atcctattga taataataat attattatga 120 

tggagcctcc atttgcgaga ggtacgggga gatattataa agcttttaaa atcacagatc 180 

gtatttggat aataccggaa agatatactt ttggatataa acctgaggat tttaataaaa 240 

gttccggtat ttttaataga gatgtttgtg aatattatga tccagattac ttaaatacta 300 

atgataaaaa gaatatattt ttacaaacaa tgatcaagtt atttaataga atcaaatcaa 360 

aaccattggg tgaaaagtta ttagagatga ttataaatgg tataccttat cttggagata 420 

gacgtgttcc actcgaagag tttaacacaa acattgctag tgtaactgtt aataaattaa 480 

tcagtaatcc aggagaagtg gagcgaaaaa aaggtatttt cgcaaattta ataatatttg 540 

gacctgggcc agttttaaat gaaaatgaga ctatagatat aggtatacaa aatcattttg 600 

catcaaggga aggcttcggg ggtataatgc aaatgaagtt ttgcccagaa tatgtaagcg 660 

tatttaataa tgttcaagaa aacaaaggcg caagtatatt taatagacgt ggatattttt 720 

cagatccagc cttgatatta atgcatgaac ttatacatgt tttacatgga ttatatggca 780 

ttaaagtaga tgatttacca attgtaccaa atgaaaaaaa attttttatg caatctacag 840 

atgctataca ggcagaagaa ctatatacat ttggaggaca agatcccagc atcataactc 900 

cttctacgga taaaagtatc tatgataaag ttttgcaaaa ttttagaggg atagttgata 960 

gacttaacaa ggttttagtt tgcatatcag atcctaacat taatattaat atatataaaa 1020 

ataaatttaa agataaatat aaattcgttg aagattctga gggaaaatat agtatagatg 1080 

tagaaagttt tgataaatta tataaaagct taatgtttgg ttttacagaa actaatatag 1140 

cagaaaatta taaaataaaa actagagctt cttattttag tgattcctta ccaccagtaa 1200 

aaataaaaaa tttattagat aatgaaatct atactataga ggaagggttt aatatatctg 1260 

ataaagatat ggaaaaagaa tatagaggtc agaataaagc tataaataaa caagcttatg 1320 

aagaaattag caaggagcat ttggctgtat ataagataca aatgtgtaaa agtgttaaag 1380 

ctccaggaat atgtattgat gttgataatg aagatttgtt ctttatagct gataaaaata 1440 

gtttttcaga tgatttatct aaaaacgaaa gaatagaata taatacacag agtaattata 1500 

tagaaaatga cttccctata aatgaattaa ttttagatac tgatttaata agtaaaatag 1560 

aattaccaag tgaaaataca gaatcactta ctgattttaa tgtagatgtt ccagtatatg 1620 

aaaaacaacc cgctataaaa aaaattttta cagatgaaaa taccatcttt caatatttat 1680 

actctcagac atttcctcta gatataagag atataagttt aacatcttca tttgatgatg 1740 

cattattatt ttctaacaaa gtttattcat ttttttctat ggattatatt aaaactgcta 1800 

ataaagtggt agaagcagga ttatttgcag gttgggtgaa acagatagta aatgattttg 1860 

taatcgaagc taataaaagc aatactatgg ataaaattgc agatatatct ctaattgttc 1920 

cttatatagg attagcttta aatgtaggaa atgaaacagc taaaggaaat tttgaaaatg 1980 

cttttgagat tgcaggagcc agtattctac tagaatttat accagaactt ttaatacctg 2040 

tagttggagc ctttttatta gaatcatata ttgacaataa aaataaaatt attaaaacaa 2100 

tagataatgc tttaactaaa agaaatgaaa aatggagtga tatgtacgga ttaatagtag 2160 

cgcaatggct ctcaacagtt aatactcaat tttatacaat aaaagaggga atgtataagg 2220 

ctttaaatta tcaagcacaa gcattggaag aaataataaa atacagatat aatatatatt 2280 

ctgaaaaaga aaagtcaaat attaacatcg attttaatga tataaattct aaacttaatg 2340 

agggtattaa ccaagctata gataatataa ataattttat aaatggatgt tctgtatcat 2400 

atttaatgaa aaaaatgatt ccattagctg tagaaaaatt actagacttt gataatactc 2460 

tcaaaaaaaa tttgttaaat tatatagatg aaaataaatt atatttgatt ggaagtgcag 2520 

aatatgaaaa atcaaaagta aataaatact tgaaaaccat tatgccgttt gatctttcaa 2580 

tatataccaa tgatacaata ctaatagaaa tgtttaataa atataatagc gaaattttaa 2640- 

ataatattat cttaaattta agatataagg ataataattt aatagattta tcaggatatg 2700 

gggcaaaggt agaggtatat gatggagtcg agcttaatga taaaaatcaa tttaaattaa 2760 

ctagttcagc aaatagtaag attagagtga ctcaaaatca gaatatcata tttaatagtg 2820 

tgttccttga ttttagcgtt agcttttgga taagaatacc taaatataag aatgatggta 2880 

tacaaaatta tattcataat gaatatacaa taattaattg tatgaaaaat aattcgggct 2940 

ggaaaatatc tattaggggt aataggataa tatggacttt aattgatata aatggaaaaa 3000 

ccaaatcggt attttttgaa tataacataa gagaagatat atcagagtat ataaatagat 3060 

ggttttttgt aactattact aataatttga ataacgctaa aatttatatt aatggtaagc 3120 

tagaatcaaa tacagatatt aaagatataa gagaagttat tgctaatggt gaaataatat 3180 

ttaaattaga tggtgatata gatagaacac aatttatttg gatgaaatat ttcagtattt 3240 

ttaatacgga attaagtcaa tcaaatattg aagaaagata taaaattcaa tcatatagcg 3300 

aatatttaaa agatttttgg ggaaatcctt taatgtacaa taaagaatat tatatgttta 3360 

atgcggggaa taaaaattca tatattaaac taaagaaaga ttcacctgta ggtgaaattt 3420 

taacacgtag caaatataat caaaattcta aatatataaa ttatagagat ttatatattg 3480 

gagaaaaatt tattataaga agaaagtcaa attctcaatc tataaatgat gatatagtta 3540 

gaaaagaaga ttatatatat ctagattttt ttaatttaaa tcaagagtgg agagtatata 3600 

cctataaata ttttaagaaa gaggaagaaa aattgttttt agctcctata agtgattctg 3660 

atgagtttta caatactata caaataaaag aatatgatga acagccaaca tatagttgtc 3720 

agttgctttt taaaaaagat gaagaaagta ctgatgagat aggattgatt ggtattcatc 3780 
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gtttctacga atctggaatt gtatttgaag agtataaaga ttatttttgt ataagtaaat 3840 

ggtacttaaa agaggtaaaa aggaaaccat ataatttaaa attgggatgt aattggcagt 3 900 

ttattcctaa agatgaaggg tggactgaat aatataacta tatgctcagc aaacctattt 3960 

tatataagaa aagtttaagt ttataaaatc ttaagtttaa ggatgtagct aaattttgaa 4020 

tattagataa actacatgtt t 4041 
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